Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 18905 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.2 MiB |
| Average record size in memory | 124.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 1 |
fLength is highly overall correlated with fWidth and 7 other fields | High correlation |
fWidth is highly overall correlated with fLength and 6 other fields | High correlation |
fSize is highly overall correlated with fLength and 4 other fields | High correlation |
fConc is highly overall correlated with fLength and 4 other fields | High correlation |
fConc1 is highly overall correlated with fLength and 4 other fields | High correlation |
fAsym is highly overall correlated with fLength and 2 other fields | High correlation |
fM3Long is highly overall correlated with fLength and 5 other fields | High correlation |
fM3Trans is highly overall correlated with fLength and 1 other fields | High correlation |
fAlpha is highly overall correlated with class | High correlation |
fDist is highly overall correlated with fLength | High correlation |
class is highly overall correlated with fAlpha | High correlation |
Reproduction
| Analysis started | 2022-11-26 18:14:43.929646 |
|---|---|
| Analysis finished | 2022-11-26 18:15:18.931535 |
| Duration | 35 seconds |
| Software version | pandas-profiling vv3.5.0 |
| Download configuration | config.json |
fLength
Real number (ℝ)
| Distinct | 18643 |
|---|---|
| Distinct (%) | 98.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 53.161416 |
| Minimum | 4.2835 |
|---|---|
| Maximum | 334.177 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 295.4 KiB |
Quantile statistics
| Minimum | 4.2835 |
|---|---|
| 5-th percentile | 16.43074 |
| Q1 | 24.3597 |
| median | 37.1295 |
| Q3 | 69.9754 |
| 95-th percentile | 139.1416 |
| Maximum | 334.177 |
| Range | 329.8935 |
| Interquartile range (IQR) | 45.6157 |
Descriptive statistics
| Standard deviation | 42.259789 |
|---|---|
| Coefficient of variation (CV) | 0.79493348 |
| Kurtosis | 5.031315 |
| Mean | 53.161416 |
| Median Absolute Deviation (MAD) | 16.288 |
| Skewness | 2.0219617 |
| Sum | 1005016.6 |
| Variance | 1785.8898 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19.1572 | 3 | < 0.1% |
| 24.8332 | 3 | < 0.1% |
| 26.9187 | 3 | < 0.1% |
| 31.3405 | 2 | < 0.1% |
| 12.9833 | 2 | < 0.1% |
| 61.6736 | 2 | < 0.1% |
| 20.1648 | 2 | < 0.1% |
| 21.0734 | 2 | < 0.1% |
| 20.9469 | 2 | < 0.1% |
| 24.5405 | 2 | < 0.1% |
| Other values (18633) | 18882 |
| Value | Count | Frequency (%) |
| 4.2835 | 1 | |
| 7.2079 | 1 | |
| 7.3606 | 1 | |
| 8.0518 | 1 | |
| 8.2304 | 1 | |
| 8.2311 | 1 | |
| 8.4802 | 1 | |
| 8.5738 | 1 | |
| 8.601 | 1 | |
| 8.6998 | 1 |
| Value | Count | Frequency (%) |
| 334.177 | 1 | |
| 310.61 | 1 | |
| 305.422 | 1 | |
| 305.324 | 1 | |
| 305.0961 | 1 | |
| 303.5676 | 1 | |
| 303.2787 | 1 | |
| 299.9304 | 1 | |
| 297.1239 | 1 | |
| 295.672 | 1 |
fWidth
Real number (ℝ)
| Distinct | 18200 |
|---|---|
| Distinct (%) | 96.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.145872 |
| Minimum | 0 |
|---|---|
| Maximum | 256.382 |
| Zeros | 98 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 295.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7.398 |
| Q1 | 11.8742 |
| median | 17.1438 |
| Q3 | 24.7124 |
| 95-th percentile | 58.34608 |
| Maximum | 256.382 |
| Range | 256.382 |
| Interquartile range (IQR) | 12.8382 |
Descriptive statistics
| Standard deviation | 18.300664 |
|---|---|
| Coefficient of variation (CV) | 0.82636909 |
| Kurtosis | 17.013498 |
| Mean | 22.145872 |
| Median Absolute Deviation (MAD) | 5.8606 |
| Skewness | 3.3945407 |
| Sum | 418667.71 |
| Variance | 334.9143 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 98 | 0.5% |
| 10.7539 | 4 | < 0.1% |
| 15.8644 | 3 | < 0.1% |
| 0.0028 | 3 | < 0.1% |
| 15.0295 | 3 | < 0.1% |
| 20.2021 | 3 | < 0.1% |
| 0.0029 | 3 | < 0.1% |
| 10.5084 | 3 | < 0.1% |
| 10.0342 | 3 | < 0.1% |
| 12.8155 | 3 | < 0.1% |
| Other values (18190) | 18779 |
| Value | Count | Frequency (%) |
| 0 | 98 | |
| 0.0001 | 3 | < 0.1% |
| 0.0002 | 1 | < 0.1% |
| 0.0006 | 1 | < 0.1% |
| 0.0019 | 1 | < 0.1% |
| 0.0025 | 2 | < 0.1% |
| 0.0026 | 2 | < 0.1% |
| 0.0027 | 1 | < 0.1% |
| 0.0028 | 3 | < 0.1% |
| 0.0029 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 256.382 | 1 | |
| 228.0385 | 1 | |
| 220.5144 | 1 | |
| 201.364 | 1 | |
| 190.5432 | 1 | |
| 190.139 | 1 | |
| 188.8866 | 1 | |
| 186.928 | 1 | |
| 179.2924 | 1 | |
| 177.782 | 1 |
fSize
Real number (ℝ)
| Distinct | 7228 |
|---|---|
| Distinct (%) | 38.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8246431 |
| Minimum | 1.9413 |
|---|---|
| Maximum | 5.3233 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 295.4 KiB |
Quantile statistics
| Minimum | 1.9413 |
|---|---|
| 5-th percentile | 2.19418 |
| Q1 | 2.4771 |
| median | 2.74 |
| Q3 | 3.1011 |
| 95-th percentile | 3.71468 |
| Maximum | 5.3233 |
| Range | 3.382 |
| Interquartile range (IQR) | 0.624 |
Descriptive statistics
| Standard deviation | 0.4723766 |
|---|---|
| Coefficient of variation (CV) | 0.16723408 |
| Kurtosis | 0.72315407 |
| Mean | 2.8246431 |
| Median Absolute Deviation (MAD) | 0.2992 |
| Skewness | 0.87304282 |
| Sum | 53399.878 |
| Variance | 0.22313965 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.1508 | 27 | 0.1% |
| 2.0774 | 24 | 0.1% |
| 2.1287 | 24 | 0.1% |
| 2.1319 | 23 | 0.1% |
| 2.3139 | 22 | 0.1% |
| 2.1414 | 22 | 0.1% |
| 2.1351 | 22 | 0.1% |
| 2.29 | 21 | 0.1% |
| 2.1717 | 20 | 0.1% |
| 2.3483 | 20 | 0.1% |
| Other values (7218) | 18680 |
| Value | Count | Frequency (%) |
| 1.9413 | 1 | < 0.1% |
| 1.9468 | 1 | < 0.1% |
| 1.9916 | 1 | < 0.1% |
| 1.9978 | 1 | < 0.1% |
| 2.0022 | 1 | < 0.1% |
| 2.0065 | 2 | < 0.1% |
| 2.0107 | 3 | < 0.1% |
| 2.0149 | 4 | |
| 2.0191 | 1 | < 0.1% |
| 2.0233 | 8 |
| Value | Count | Frequency (%) |
| 5.3233 | 1 | |
| 5.1795 | 1 | |
| 5.1467 | 1 | |
| 5.0118 | 1 | |
| 5.01 | 1 | |
| 4.9946 | 1 | |
| 4.9518 | 1 | |
| 4.9369 | 1 | |
| 4.905 | 1 | |
| 4.8501 | 1 |
fConc
Real number (ℝ)
| Distinct | 6410 |
|---|---|
| Distinct (%) | 33.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.38024713 |
| Minimum | 0.0131 |
|---|---|
| Maximum | 0.893 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 295.4 KiB |
Quantile statistics
| Minimum | 0.0131 |
|---|---|
| 5-th percentile | 0.12652 |
| Q1 | 0.2358 |
| median | 0.354 |
| Q3 | 0.5035 |
| 95-th percentile | 0.7342 |
| Maximum | 0.893 |
| Range | 0.8799 |
| Interquartile range (IQR) | 0.2677 |
Descriptive statistics
| Standard deviation | 0.18270933 |
|---|---|
| Coefficient of variation (CV) | 0.48050155 |
| Kurtosis | -0.51687105 |
| Mean | 0.38024713 |
| Median Absolute Deviation (MAD) | 0.13 |
| Skewness | 0.48853997 |
| Sum | 7188.5719 |
| Variance | 0.033382701 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6 | 15 | 0.1% |
| 0.4116 | 12 | 0.1% |
| 0.2979 | 12 | 0.1% |
| 0.4 | 12 | 0.1% |
| 0.2214 | 11 | 0.1% |
| 0.2175 | 11 | 0.1% |
| 0.5 | 11 | 0.1% |
| 0.6154 | 11 | 0.1% |
| 0.193 | 11 | 0.1% |
| 0.2802 | 10 | 0.1% |
| Other values (6400) | 18789 |
| Value | Count | Frequency (%) |
| 0.0131 | 1 | |
| 0.0133 | 1 | |
| 0.0137 | 1 | |
| 0.0139 | 2 | |
| 0.0158 | 1 | |
| 0.0162 | 1 | |
| 0.0171 | 1 | |
| 0.0188 | 1 | |
| 0.0196 | 1 | |
| 0.0206 | 1 |
| Value | Count | Frequency (%) |
| 0.893 | 1 | |
| 0.8912 | 1 | |
| 0.8889 | 1 | |
| 0.8846 | 1 | |
| 0.8786 | 1 | |
| 0.8778 | 1 | |
| 0.8772 | 1 | |
| 0.8757 | 1 | |
| 0.8745 | 1 | |
| 0.8743 | 1 |
fConc1
Real number (ℝ)
| Distinct | 4421 |
|---|---|
| Distinct (%) | 23.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.21455974 |
| Minimum | 0.0003 |
|---|---|
| Maximum | 0.6752 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 295.4 KiB |
Quantile statistics
| Minimum | 0.0003 |
|---|---|
| 5-th percentile | 0.0671 |
| Q1 | 0.1285 |
| median | 0.1964 |
| Q3 | 0.285 |
| 95-th percentile | 0.42208 |
| Maximum | 0.6752 |
| Range | 0.6749 |
| Interquartile range (IQR) | 0.1565 |
Descriptive statistics
| Standard deviation | 0.11038355 |
|---|---|
| Coefficient of variation (CV) | 0.51446536 |
| Kurtosis | 0.031125523 |
| Mean | 0.21455974 |
| Median Absolute Deviation (MAD) | 0.0753 |
| Skewness | 0.68688736 |
| Sum | 4056.2518 |
| Variance | 0.012184528 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.194 | 18 | 0.1% |
| 0.1939 | 16 | 0.1% |
| 0.2126 | 16 | 0.1% |
| 0.2 | 16 | 0.1% |
| 0.217 | 15 | 0.1% |
| 0.2251 | 15 | 0.1% |
| 0.1581 | 14 | 0.1% |
| 0.1279 | 14 | 0.1% |
| 0.1772 | 14 | 0.1% |
| 0.1245 | 14 | 0.1% |
| Other values (4411) | 18753 |
| Value | Count | Frequency (%) |
| 0.0003 | 1 | |
| 0.0008 | 1 | |
| 0.0011 | 1 | |
| 0.0015 | 1 | |
| 0.002 | 1 | |
| 0.0047 | 1 | |
| 0.005 | 1 | |
| 0.0072 | 1 | |
| 0.0073 | 1 | |
| 0.0076 | 1 |
| Value | Count | Frequency (%) |
| 0.6752 | 1 | |
| 0.674 | 1 | |
| 0.643 | 1 | |
| 0.637 | 1 | |
| 0.6296 | 1 | |
| 0.6283 | 1 | |
| 0.6264 | 1 | |
| 0.6242 | 1 | |
| 0.6224 | 1 | |
| 0.6204 | 1 |
fAsym
Real number (ℝ)
| Distinct | 18704 |
|---|---|
| Distinct (%) | 98.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -4.177867 |
| Minimum | -457.9161 |
|---|---|
| Maximum | 575.2407 |
| Zeros | 40 |
| Zeros (%) | 0.2% |
| Negative | 8380 |
| Negative (%) | 44.3% |
| Memory size | 295.4 KiB |
Quantile statistics
| Minimum | -457.9161 |
|---|---|
| 5-th percentile | -110.8296 |
| Q1 | -20.4791 |
| median | 4.0629 |
| Q3 | 24.1335 |
| 95-th percentile | 65.52956 |
| Maximum | 575.2407 |
| Range | 1033.1568 |
| Interquartile range (IQR) | 44.6126 |
Descriptive statistics
| Standard deviation | 59.010059 |
|---|---|
| Coefficient of variation (CV) | -14.124447 |
| Kurtosis | 8.2314395 |
| Mean | -4.177867 |
| Median Absolute Deviation (MAD) | 21.6645 |
| Skewness | -1.0379255 |
| Sum | -78982.575 |
| Variance | 3482.1871 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 40 | 0.2% |
| -0.0001 | 7 | < 0.1% |
| 8.8077 | 3 | < 0.1% |
| -0.5062 | 3 | < 0.1% |
| -1.4761 | 3 | < 0.1% |
| 5.2783 | 2 | < 0.1% |
| 21.8701 | 2 | < 0.1% |
| -14.7987 | 2 | < 0.1% |
| -1.2309 | 2 | < 0.1% |
| -25.9338 | 2 | < 0.1% |
| Other values (18694) | 18839 |
| Value | Count | Frequency (%) |
| -457.9161 | 1 | |
| -449.9526 | 1 | |
| -382.594 | 1 | |
| -381.734 | 1 | |
| -378.9457 | 1 | |
| -368.633 | 1 | |
| -363.3382 | 1 | |
| -353.934 | 1 | |
| -353.26 | 1 | |
| -349.757 | 1 |
| Value | Count | Frequency (%) |
| 575.2407 | 1 | |
| 473.0654 | 1 | |
| 464.631 | 1 | |
| 444.401 | 1 | |
| 433.0957 | 1 | |
| 402.925 | 1 | |
| 402.1863 | 1 | |
| 400.284 | 1 | |
| 396.3379 | 1 | |
| 384.3477 | 1 |
fM3Long
Real number (ℝ)
| Distinct | 18693 |
|---|---|
| Distinct (%) | 98.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.618826 |
| Minimum | -331.78 |
|---|---|
| Maximum | 238.321 |
| Zeros | 39 |
| Zeros (%) | 0.2% |
| Negative | 6558 |
| Negative (%) | 34.7% |
| Memory size | 295.4 KiB |
Quantile statistics
| Minimum | -331.78 |
|---|---|
| 5-th percentile | -79.41338 |
| Q1 | -12.7693 |
| median | 15.338 |
| Q3 | 35.8694 |
| 95-th percentile | 82.96334 |
| Maximum | 238.321 |
| Range | 570.101 |
| Interquartile range (IQR) | 48.6387 |
Descriptive statistics
| Standard deviation | 50.900687 |
|---|---|
| Coefficient of variation (CV) | 4.7934381 |
| Kurtosis | 4.7170806 |
| Mean | 10.618826 |
| Median Absolute Deviation (MAD) | 25.3311 |
| Skewness | -1.1296137 |
| Sum | 200748.91 |
| Variance | 2590.88 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 39 | 0.2% |
| -0.0001 | 4 | < 0.1% |
| 16.0747 | 3 | < 0.1% |
| 55.3976 | 2 | < 0.1% |
| -19.9547 | 2 | < 0.1% |
| 10.6638 | 2 | < 0.1% |
| 9.164 | 2 | < 0.1% |
| 40.4022 | 2 | < 0.1% |
| 20.2691 | 2 | < 0.1% |
| 19.839 | 2 | < 0.1% |
| Other values (18683) | 18845 |
| Value | Count | Frequency (%) |
| -331.78 | 1 | |
| -318.3002 | 1 | |
| -297.1717 | 1 | |
| -293.1762 | 1 | |
| -287.5067 | 1 | |
| -287.3636 | 1 | |
| -284.7038 | 1 | |
| -281.9541 | 1 | |
| -281.844 | 1 | |
| -281.435 | 1 |
| Value | Count | Frequency (%) |
| 238.321 | 1 | |
| 231.446 | 1 | |
| 227.8174 | 1 | |
| 226.3506 | 1 | |
| 222.417 | 1 | |
| 217.934 | 1 | |
| 217.624 | 1 | |
| 216.985 | 1 | |
| 215.894 | 1 | |
| 203.863 | 1 |
fM3Trans
Real number (ℝ)
| Distinct | 18390 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.25936414 |
| Minimum | -205.8947 |
|---|---|
| Maximum | 179.851 |
| Zeros | 59 |
| Zeros (%) | 0.3% |
| Negative | 9343 |
| Negative (%) | 49.4% |
| Memory size | 295.4 KiB |
Quantile statistics
| Minimum | -205.8947 |
|---|---|
| 5-th percentile | -25.63852 |
| Q1 | -10.8358 |
| median | 0.75 |
| Q3 | 10.9489 |
| 95-th percentile | 26.89144 |
| Maximum | 179.851 |
| Range | 385.7457 |
| Interquartile range (IQR) | 21.7847 |
Descriptive statistics
| Standard deviation | 20.775268 |
|---|---|
| Coefficient of variation (CV) | 80.100773 |
| Kurtosis | 8.6759437 |
| Mean | 0.25936414 |
| Median Absolute Deviation (MAD) | 10.8861 |
| Skewness | 0.12358922 |
| Sum | 4903.2791 |
| Variance | 431.61177 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 59 | 0.3% |
| -0.0001 | 24 | 0.1% |
| 0.0001 | 18 | 0.1% |
| 11.1602 | 3 | < 0.1% |
| -8.975 | 3 | < 0.1% |
| -5.4454 | 3 | < 0.1% |
| 8.3039 | 2 | < 0.1% |
| 14.2184 | 2 | < 0.1% |
| -9.7914 | 2 | < 0.1% |
| 11.8212 | 2 | < 0.1% |
| Other values (18380) | 18787 |
| Value | Count | Frequency (%) |
| -205.8947 | 1 | |
| -164.14 | 1 | |
| -149.5513 | 1 | |
| -142.5894 | 1 | |
| -142.119 | 1 | |
| -135.5051 | 1 | |
| -134.75 | 1 | |
| -134.395 | 1 | |
| -133.1359 | 1 | |
| -132.416 | 1 |
| Value | Count | Frequency (%) |
| 179.851 | 1 | |
| 170.692 | 1 | |
| 163.2697 | 1 | |
| 154.865 | 1 | |
| 143.8753 | 1 | |
| 139.2361 | 1 | |
| 132.589 | 1 | |
| 132.388 | 1 | |
| 131.5547 | 1 | |
| 130.8545 | 1 |
fAlpha
Real number (ℝ)
| Distinct | 17981 |
|---|---|
| Distinct (%) | 95.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.551644 |
| Minimum | 0 |
|---|---|
| Maximum | 90 |
| Zeros | 5 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 295.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.92724 |
| Q1 | 5.5164 |
| median | 17.533 |
| Q3 | 45.704 |
| 95-th percentile | 80.71102 |
| Maximum | 90 |
| Range | 90 |
| Interquartile range (IQR) | 40.1876 |
Descriptive statistics
| Standard deviation | 26.083055 |
|---|---|
| Coefficient of variation (CV) | 0.94669687 |
| Kurtosis | -0.52103306 |
| Mean | 27.551644 |
| Median Absolute Deviation (MAD) | 14.569 |
| Skewness | 0.85745627 |
| Sum | 520863.83 |
| Variance | 680.32577 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.0002 | 7 | < 0.1% |
| 0 | 5 | < 0.1% |
| 0.256 | 4 | < 0.1% |
| 2.701 | 4 | < 0.1% |
| 1.29 | 4 | < 0.1% |
| 0.804 | 4 | < 0.1% |
| 0.386 | 4 | < 0.1% |
| 2.76 | 4 | < 0.1% |
| 90 | 4 | < 0.1% |
| 3.4161 | 4 | < 0.1% |
| Other values (17971) | 18861 |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 0.0002 | 7 | |
| 0.0003 | 2 | < 0.1% |
| 0.001 | 1 | < 0.1% |
| 0.0031 | 1 | < 0.1% |
| 0.0056 | 1 | < 0.1% |
| 0.0086 | 1 | < 0.1% |
| 0.009 | 1 | < 0.1% |
| 0.0097 | 1 | < 0.1% |
| 0.0103 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 90 | 4 | |
| 89.9798 | 1 | < 0.1% |
| 89.9579 | 1 | < 0.1% |
| 89.9535 | 1 | < 0.1% |
| 89.9528 | 1 | < 0.1% |
| 89.9229 | 1 | < 0.1% |
| 89.9155 | 1 | < 0.1% |
| 89.9087 | 1 | < 0.1% |
| 89.9076 | 1 | < 0.1% |
| 89.9042 | 1 | < 0.1% |
fDist
Real number (ℝ)
| Distinct | 18437 |
|---|---|
| Distinct (%) | 97.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 193.71255 |
| Minimum | 1.2826 |
|---|---|
| Maximum | 495.561 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 295.4 KiB |
Quantile statistics
| Minimum | 1.2826 |
|---|---|
| 5-th percentile | 71.41204 |
| Q1 | 142.269 |
| median | 191.832 |
| Q3 | 240.409 |
| 95-th percentile | 326.5014 |
| Maximum | 495.561 |
| Range | 494.2784 |
| Interquartile range (IQR) | 98.14 |
Descriptive statistics
| Standard deviation | 74.685712 |
|---|---|
| Coefficient of variation (CV) | 0.38554916 |
| Kurtosis | -0.11237649 |
| Mean | 193.71255 |
| Median Absolute Deviation (MAD) | 49.037 |
| Skewness | 0.22878544 |
| Sum | 3662135.8 |
| Variance | 5577.9556 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 209.954 | 3 | < 0.1% |
| 195.287 | 3 | < 0.1% |
| 116.737 | 3 | < 0.1% |
| 246.013 | 3 | < 0.1% |
| 186.828 | 3 | < 0.1% |
| 295.34 | 3 | < 0.1% |
| 185.909 | 3 | < 0.1% |
| 216.032 | 3 | < 0.1% |
| 185.927 | 3 | < 0.1% |
| 100.395 | 3 | < 0.1% |
| Other values (18427) | 18875 |
| Value | Count | Frequency (%) |
| 1.2826 | 1 | |
| 5.5449 | 1 | |
| 5.5922 | 1 | |
| 5.6998 | 1 | |
| 5.7456 | 1 | |
| 6.564 | 1 | |
| 6.6852 | 1 | |
| 9.1574 | 1 | |
| 13.1108 | 1 | |
| 14.0229 | 1 |
| Value | Count | Frequency (%) |
| 495.561 | 1 | |
| 466.4078 | 1 | |
| 450.953 | 1 | |
| 450.402 | 1 | |
| 450.349 | 1 | |
| 448.0295 | 1 | |
| 446.488 | 1 | |
| 438.901 | 1 | |
| 438.8574 | 1 | |
| 437.477 | 1 |
class
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 295.4 KiB |
| g | |
|---|---|
| h |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 18905 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | g |
|---|---|
| 2nd row | g |
| 3rd row | g |
| 4th row | g |
| 5th row | g |
Common Values
| Value | Count | Frequency (%) |
| g | 12332 | |
| h | 6573 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| g | 12332 | |
| h | 6573 |
Most occurring characters
| Value | Count | Frequency (%) |
| g | 12332 | |
| h | 6573 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18905 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| g | 12332 | |
| h | 6573 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18905 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| g | 12332 | |
| h | 6573 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18905 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| g | 12332 | |
| h | 6573 |
Auto
The auto setting is an interpretable pairwise column metric of the following mapping:- Variable_type-Variable_type : Method, Range
- Categorical-Categorical : Cramer's V, [0,1]
- Numerical-Categorical : Cramer's V, [0,1] (using a discretized numerical column)
- Numerical-Numerical : Spearman's ρ, [-1,1]
This configuration uses the recommended metric for each pair of columns.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.| fLength | fWidth | fSize | fConc | fConc1 | fAsym | fM3Long | fM3Trans | fAlpha | fDist | class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 28.7967 | 16.0021 | 2.6449 | 0.3918 | 0.1982 | 27.7004 | 22.0110 | -8.2027 | 40.0920 | 81.8828 | g |
| 1 | 31.6036 | 11.7235 | 2.5185 | 0.5303 | 0.3773 | 26.2722 | 23.8238 | -9.9574 | 6.3609 | 205.2610 | g |
| 2 | 162.0520 | 136.0310 | 4.0612 | 0.0374 | 0.0187 | 116.7410 | -64.8580 | -45.2160 | 76.9600 | 256.7880 | g |
| 3 | 23.8172 | 9.5728 | 2.3385 | 0.6147 | 0.3922 | 27.2107 | -6.4633 | -7.1513 | 10.4490 | 116.7370 | g |
| 4 | 75.1362 | 30.9205 | 3.1611 | 0.3168 | 0.1832 | -5.5277 | 28.5525 | 21.8393 | 4.6480 | 356.4620 | g |
| 5 | 51.6240 | 21.1502 | 2.9085 | 0.2420 | 0.1340 | 50.8761 | 43.1887 | 9.8145 | 3.6130 | 238.0980 | g |
| 6 | 48.2468 | 17.3565 | 3.0332 | 0.2529 | 0.1515 | 8.5730 | 38.0957 | 10.5868 | 4.7920 | 219.0870 | g |
| 7 | 26.7897 | 13.7595 | 2.5521 | 0.4236 | 0.2174 | 29.6339 | 20.4560 | -2.9292 | 0.8120 | 237.1340 | g |
| 8 | 96.2327 | 46.5165 | 4.1540 | 0.0779 | 0.0390 | 110.3550 | 85.0486 | 43.1844 | 4.8540 | 248.2260 | g |
| 9 | 46.7619 | 15.1993 | 2.5786 | 0.3377 | 0.1913 | 24.7548 | 43.8771 | -6.6812 | 7.8750 | 102.2510 | g |
| fLength | fWidth | fSize | fConc | fConc1 | fAsym | fM3Long | fM3Trans | fAlpha | fDist | class | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 19010 | 32.4902 | 10.6723 | 2.4742 | 0.4664 | 0.2735 | -27.0097 | -21.1687 | 8.4813 | 69.1730 | 120.6680 | h |
| 19011 | 79.5528 | 44.9929 | 3.5488 | 0.1656 | 0.0900 | -39.6213 | 53.7866 | -30.0054 | 15.8075 | 311.5680 | h |
| 19012 | 31.8373 | 13.8734 | 2.8251 | 0.4169 | 0.1988 | -16.4919 | -27.1448 | 11.1098 | 11.3663 | 100.0566 | h |
| 19013 | 182.5003 | 76.5568 | 3.6872 | 0.1123 | 0.0666 | 192.2675 | 93.0302 | -62.6192 | 82.1691 | 283.4731 | h |
| 19014 | 43.2980 | 17.3545 | 2.8307 | 0.2877 | 0.1646 | -60.1842 | -33.8513 | -3.6545 | 78.4099 | 224.8299 | h |
| 19015 | 21.3846 | 10.9170 | 2.6161 | 0.5857 | 0.3934 | 15.2618 | 11.5245 | 2.8766 | 2.4229 | 106.8258 | h |
| 19016 | 28.9452 | 6.7020 | 2.2672 | 0.5351 | 0.2784 | 37.0816 | 13.1853 | -2.9632 | 86.7975 | 247.4560 | h |
| 19017 | 75.4455 | 47.5305 | 3.4483 | 0.1417 | 0.0549 | -9.3561 | 41.0562 | -9.4662 | 30.2987 | 256.5166 | h |
| 19018 | 120.5135 | 76.9018 | 3.9939 | 0.0944 | 0.0683 | 5.8043 | -93.5224 | -63.8389 | 84.6874 | 408.3166 | h |
| 19019 | 187.1814 | 53.0014 | 3.2093 | 0.2876 | 0.1539 | -167.3125 | -168.4558 | 31.4755 | 52.7310 | 272.3174 | h |